Goto

Collaborating Authors

 Sea of Japan


Explainable AI-Based Interface System for Weather Forecasting Model

Kim, Soyeon, Choi, Junho, Choi, Yeji, Lee, Subeen, Stitsyuk, Artyom, Park, Minkyoung, Jeong, Seongyeop, Baek, Youhyun, Choi, Jaesik

arXiv.org Artificial Intelligence

Machine learning (ML) is becoming increasingly popular in meteorological decision-making. Although the literature on explainable artificial intelligence (XAI) is growing steadily, user-centered XAI studies have not extend to this domain yet. This study defines three requirements for explanations of black-box models in meteorology through user studies: statistical model performance for different rainfall scenarios to identify model bias, model reasoning, and the confidence of model outputs. Appropriate XAI methods are mapped to each requirement, and the generated explanations are tested quantitatively and qualitatively. An XAI interface system is designed based on user feedback. The results indicate that the explanations increase decision utility and user trust. Users prefer intuitive explanations over those based on XAI algorithms even for potentially easy-to-recognize examples. These findings can provide evidence for future research on user-centered XAI algorithms, as well as a basis to improve the usability of AI systems in practice.


Most Japanese high school textbooks to include QR codes

The Japan Times

Almost all textbooks to be used by first- and second-year high school students in Japan from fiscal 2026 will include quick response (QR) codes that link to websites with video and audio learning aid materials, sources said Tuesday. The education ministry said the same day that a total of 253 textbooks in 13 subjects have passed the second screenings under the current curriculum guidelines. In response to the rapid progress of digitalization, many of the textbooks include descriptions on information ethics and generative artificial intelligence. The average number of pages per textbook in 11 commonly taught subjects came to 321, slightly up from the previous screenings in 2021. All geography-history and civics textbooks take up the Northern Territories, which are effectively controlled by Russia; Takeshima, the Sea of Japan islets controlled by South Korea; and the Japanese-administered Senkaku Islands, which are also claimed by China.


Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions

Kim, JiWoo, Chang, Minsuk, Bak, JinYeong

arXiv.org Artificial Intelligence

Traditional text-based human-AI interactions often adhere to a strict turn-taking approach. In this research, we propose a novel approach that incorporates overlapping messages, mirroring natural human conversations. Through a formative study, we observed that even in text-based contexts, users instinctively engage in overlapping behaviors like "A: Today I went to-" "B: yeah." To capitalize on these insights, we developed OverlapBot, a prototype chatbot where both AI and users can initiate overlapping. Our user study revealed that OverlapBot was perceived as more communicative and immersive than traditional turn-taking chatbot, fostering faster and more natural interactions. Our findings contribute to the understanding of design space for overlapping interactions. We also provide recommendations for implementing overlap-capable AI interactions to enhance the fluidity and engagement of text-based conversations.


TopoFormer: Integrating Transformers and ConvLSTMs for Coastal Topography Prediction

Munian, Santosh, Karakuş, Oktay, Russell, William, Nelson, Gwyn

arXiv.org Artificial Intelligence

This paper presents TopoFormer, a novel hybrid deep learning architecture that integrates transformer-based encoders with convolutional long short-term memory (ConvLSTM) layers for the precise prediction of topographic beach profiles referenced to elevation datums, with a particular focus on Mean Low Water Springs (MLWS) and Mean Low Water Neaps (MLWN). Accurate topographic estimation down to MLWS is critical for coastal management, navigation safety, and environmental monitoring. Leveraging a comprehensive dataset from the Wales Coastal Monitoring Centre (WCMC), consisting of over 2000 surveys across 36 coastal survey units, TopoFormer addresses key challenges in topographic prediction, including temporal variability and data gaps in survey measurements. The architecture uniquely combines multi-head attention mechanisms and ConvLSTM layers to capture both long-range dependencies and localized temporal patterns inherent in beach profiles data. While all models demonstrated strong performance, TopoFormer achieved the lowest mean absolute error (MAE), as low as 2 cm, and provided superior accuracy in both in-distribution (ID) and out-of-distribution (OOD) evaluations. Accurate topographic measurements are essential for coastal applications such as flood risk assessment, erosion monitoring, habitat mapping, and navigation safety.


Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

Yu, Zhenyu

arXiv.org Artificial Intelligence

The forest serves as the most significant terrestrial carbon stock mechanism, effectively reducing atmospheric CO$_2$ concentrations and mitigating climate change. Remote sensing provides high data accuracy and enables large-scale observations. Optical images facilitate long-term monitoring, which is crucial for future carbon stock estimation studies. This study focuses on Huize County, Qujing City, Yunnan Province, China, utilizing GF-1 WFV satellite imagery. The KD-VGG and KD-UNet modules were introduced for initial feature extraction, and the improved implicit diffusion model (IIDM) was proposed. The results showed: (1) The VGG module improved initial feature extraction, improving accuracy, and reducing inference time with optimized model parameters. (2) The Cross-attention + MLPs module enabled effective feature fusion, establishing critical relationships between global and local features, achieving high-accuracy estimation. (3) The IIDM model, a novel contribution, demonstrated the highest estimation accuracy with an RMSE of 12.17\%, significantly improving by 41.69\% to 42.33\% compared to the regression model. In carbon stock estimation, the generative model excelled in extracting deeper features, significantly outperforming other models, demonstrating the feasibility of AI-generated content in quantitative remote sensing. The 16-meter resolution estimates provide a robust basis for tailoring forest carbon sink regulations, enhancing regional carbon stock management.


A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

Hu, Lijie, Liu, Liang, Yang, Shu, Chen, Xin, Xiao, Hongru, Li, Mengdi, Zhou, Pan, Ali, Muhammad Asif, Wang, Di

arXiv.org Artificial Intelligence

Chain-of-Thought (CoT) holds a significant place in augmenting the reasoning performance for large language models (LLMs). While some studies focus on improving CoT accuracy through methods like retrieval enhancement, yet a rigorous explanation for why CoT achieves such success remains unclear. In this paper, we analyze CoT methods under two different settings by asking the following questions: (1) For zero-shot CoT, why does prompting the model with "let's think step by step" significantly impact its outputs? (2) For few-shot CoT, why does providing examples before questioning the model could substantially improve its reasoning ability? To answer these questions, we conduct a top-down explainable analysis from the Hopfieldian view and propose a Read-and-Control approach for controlling the accuracy of CoT. Through extensive experiments on seven datasets for three different tasks, we demonstrate that our framework can decipher the inner workings of CoT, provide reasoning error localization, and control to come up with the correct reasoning path.


Locating and Extracting Relational Concepts in Large Language Models

Wang, Zijian, White, Britney, Xu, Chang

arXiv.org Artificial Intelligence

Relational concepts are indeed foundational to the structure of knowledge representation, as they facilitate the association between various entity concepts, allowing us to express and comprehend complex world knowledge. By expressing relational concepts in natural language prompts, people can effortlessly interact with large language models (LLMs) and recall desired factual knowledge. However, the process of knowledge recall lacks interpretability, and representations of relational concepts within LLMs remain unknown to us. In this paper, we identify hidden states that can express entity and relational concepts through causal mediation analysis in fact recall processes. Our finding reveals that at the last token position of the input prompt, there are hidden states that solely express the causal effects of relational concepts. Based on this finding, we assume that these hidden states can be treated as relational representations and we can successfully extract them from LLMs. The experimental results demonstrate high credibility of the relational representations: they can be flexibly transplanted into other fact recall processes, and can also be used as robust entity connectors. Moreover, we also show that the relational representations exhibit significant potential for controllable fact recall through relation rewriting.


CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

Zhu, Mengna, Xu, Zijie, Zeng, Kaisheng, Xiao, Kaiming, Wang, Mao, Ke, Wenjun, Huang, Hongbin

arXiv.org Artificial Intelligence

Extracting structured event knowledge, including event triggers and corresponding arguments, from military texts is fundamental to many applications, such as intelligence analysis and decision assistance. However, event extraction in the military field faces the data scarcity problem, which impedes the research of event extraction models in this domain. To alleviate this problem, we propose CMNEE, a large-scale, document-level open-source Chinese Military News Event Extraction dataset. It contains 17,000 documents and 29,223 events, which are all manually annotated based on a pre-defined schema for the military domain including 8 event types and 11 argument role types. We designed a two-stage, multi-turns annotation strategy to ensure the quality of CMNEE and reproduced several state-of-the-art event extraction models with a systematic evaluation. The experimental results on CMNEE fall shorter than those on other domain datasets obviously, which demonstrates that event extraction for military domain poses unique challenges and requires further research efforts. Our code and data can be obtained from https://github.com/Mzzzhu/CMNEE.


Shin-Etsu Chemical to build new chip materials plant in Gunma

The Japan Times

Shin-Etsu Chemical said Tuesday that it will build a new semiconductor materials plant in the city of Isesaki, Gunma Prefecture, at a cost of some 83 billion. The plant, slated to be completed by 2026, will make photoresists, including extreme ultraviolet resists used for state-of-the-art chips for generative artificial intelligence systems, and other semiconductor-related materials. The investment includes the cost to buy a 150,000-square-meter site for the factory. It will be the Japanese company's first new domestic production base since its plant in the city of Kamisu, Ibaraki Prefecture, was built in 1970. The Isesaki plant will also carry out research and development in the future. Currently, the company makes photoresists and related products at its plants in the prefectures of Niigata and Fukui, both along the Sea of Japan, and in Taiwan.


Conceptual and Unbiased Reasoning in Language Models

Zhou, Ben, Zhang, Hongming, Chen, Sihao, Yu, Dian, Wang, Hongwei, Peng, Baolin, Roth, Dan, Yu, Dong

arXiv.org Artificial Intelligence

Conceptual reasoning, the ability to reason in abstract and high-level perspectives, is key to generalization in human cognition. However, limited study has been done on large language models' capability to perform conceptual reasoning. In this work, we bridge this gap and propose a novel conceptualization framework that forces models to perform conceptual reasoning on abstract questions and generate solutions in a verifiable symbolic space. Using this framework as an analytical tool, we show that existing large language models fall short on conceptual reasoning, dropping 9% to 28% on various benchmarks compared to direct inference methods. We then discuss how models can improve since high-level abstract reasoning is key to unbiased and generalizable decision-making. We propose two techniques to add trustworthy induction signals by generating familiar questions with similar underlying reasoning paths and asking models to perform self-refinement. Experiments show that our proposed techniques improve models' conceptual reasoning performance by 8% to 11%, achieving a more robust reasoning system that relies less on inductive biases.